Picture for Jonathan Pilault

Jonathan Pilault

The Zamba2 Suite: Technical Report

Add code
Nov 22, 2024
Viaarxiv icon

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Add code
Aug 09, 2024
Figure 1 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 2 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 3 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Figure 4 for Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
Viaarxiv icon

Zyda: A 1.3T Dataset for Open Language Modeling

Add code
Jun 04, 2024
Viaarxiv icon

Zamba: A Compact 7B SSM Hybrid Model

Add code
May 26, 2024
Figure 1 for Zamba: A Compact 7B SSM Hybrid Model
Figure 2 for Zamba: A Compact 7B SSM Hybrid Model
Figure 3 for Zamba: A Compact 7B SSM Hybrid Model
Figure 4 for Zamba: A Compact 7B SSM Hybrid Model
Viaarxiv icon

Course Correcting Koopman Representations

Add code
Oct 23, 2023
Figure 1 for Course Correcting Koopman Representations
Figure 2 for Course Correcting Koopman Representations
Figure 3 for Course Correcting Koopman Representations
Figure 4 for Course Correcting Koopman Representations
Viaarxiv icon

On Conditional and Compositional Language Model Differentiable Prompting

Add code
Jul 04, 2023
Viaarxiv icon

Block-State Transformer

Add code
Jun 15, 2023
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Figure 3 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction

Add code
Jan 24, 2023
Figure 1 for Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Figure 2 for Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Figure 3 for Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Figure 4 for Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Viaarxiv icon

Using Graph Algorithms to Pretrain Graph Completion Transformers

Add code
Oct 14, 2022
Figure 1 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 2 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 3 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Figure 4 for Using Graph Algorithms to Pretrain Graph Completion Transformers
Viaarxiv icon